Improved Metadata Access During Error Handling Routines 



TECHNICAL FIELD 

[1] The present invention relates generally to backup and disaster recovery services 
for a data storage system and, in particular, to improving the efficiency of error 
handling routines following the interruption of a data copy operation. 

BACKGROUND ART 

[2] High end storage controllers, such as the International Business Machines 
Corporation (IBM®) Enterprise Storage Server® manage Input/Output (I/O) requests 
from networked hosts to one or more storage units, such as a direct access storage 
device (DASD), Redundant Array of Independent Disks (RAID Array), and Just a 
Bunch of Disks (JBOD). Storage controllers include one or more host bus adapters 
or interfaces to communicate with one or more hosts over a network and adapters or 
interfaces to communicate with the storage units. 

[3] Data integrity is a critical factor in large computer data systems. Consequently, 
backup systems have been developed and integrated into storage controller to 
prevent the loss of data in the event of various types of failures. Backup systems 
provided by IBM, known generally as "copy services", include Peer-to-Peer Remote 
Copy, FlashCopy® and Extended Remote Copy and maintain a separate, consistent 
copy of customer data. As illustrated in Fig. 1, in a storage system 100, data 
generated by a host device 110 is transmitted to a primary storage unit 120 for 
storage on associated storage devices 130. A copy of the data is also transmitted, 
such as over a fibre channel network 140, and to a secondary storage unit 150 for 
storage on associated storage devices 160. Because of the flexibility of network 
interconnections, the primary and secondary units 120 and 150 may be physically 
located remote from the host 110. And, for additional data security, the primary and 
secondary units 120 and 150 may be (but need not be) physically located distant 
from each other, thereby reducing the likelihood of a single disaster simultaneously 
harming both the primary and secondary units 120 and 150. It will be appreciated 
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that the primary and secondary units 120 and 150 may be the same physical unit, 
divided logically into two. 

[4] Due at least in part to the risk of a power loss or other comparable significant 
event while customer data is being copied to the secondary unit, the state of the 
copy services operation is stored in memory and updated as the copy services 
operation progresses. The state data (as well as other control information used 
intemally by the storage controller), known as "metadata", is periodically destaged 
from the memory to reserved areas of the customer storage devices 130. 
Preferably, the metadata is divided into tracks of, for example, 8 KB each. There 
may be as many as 2000 or more such tracks. 

[5] During an error handing routine or behavior (EHB), such as an internal 
microcode load (IML), following a power loss during a copy services operation or 
other comparable significant event, the metadata is staged from the storage device 
to the memory where it becomes available for the recovery operation. In a 
conventional EHB, other EHB activities must be paused while all of the metadata 
tracks are staged to memory. Only after all of the metadata has been staged may 
the EHB be completed and normal customer I/O operations resumed. 
Consequently, the requirement to stage all of the metadata tracks delays completion 
of the EHB and adversely impacts customer I/O. 

[6] Consequently a need remains for improving the performance of metadata 
recovery during EHB activities without adversely affecting customer operations. 

SUMMARY OF THE INVENTION 

[7] The present invention provides a method for initializing a data storage controller. 
Following commencement of an IML, copy state data tracks are background staged 
from a disk storage device to a memory device. If a request is received to access a 
track of copy state data and the track has been staged, the track is accessed. If the 
requested track has not been staged, requester waits while the requested track is 
staged; then the requested track is accessed. Preferably, completion of the IML is 
independent of the completion of the staging of copy state data tracks. 
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[8] The present invention further provides methods for processing metadata in a 
storage controller. During a copy services operation, the current state of the 
operation is maintained in a memory device. Periodically, the current state is 
destaged to metadata tracks on a storage device. Following commencement of an 
error handling routine, copy state data tracks are background staged from a disk 
storage device to a memory device. If a request is received to access a track of 
copy state data and the track has been staged, the track is accessed, if the 
requested track has not been staged, the requester waits while the requested track 
is staged; then the requested track is accessed. Preferably, completion of the error 
handling routine is independent of the completion of the staging of copy state data 
tracks. 

[9] The present invention further provides a data storage controller, including a 
memory device for storing a current state of a copying operation as metadata tracks 
and means for processing an error handling routine. The means for processing an 
en-or handling routine Includes means for Initializing a parameter of each metadata 
track to a first state, means for commencing background staging of the metadata 
tracks and means for changing the state of the parameter to a second state when a 
track is staged. If a request is received to access a track of copy state data and the 
corresponding parameter indicates that the track has been staged, the track is 
accessed. If the corresponding parameter indicates that the requested track has not 
been staged, the requester waits while the requested track is staged; then the 
requested track is accessed. Completion of the error handling routine is 
independent of the completion of the staging of copy state data tracks. 

[10] The present Invention further provides a copy services component of a data 
storage controller, including means for processing error handling routines. The 
means for processing error handling routines includes means for initializing a 
parameter of each metadata track to a first state, means for commencing 
background staging of the metadata tracks and means for changing the state of the 
parameter to a second state when a track is staged. If a request is received to 
access a track of copy state data and the corresponding parameter indicates that the 
track has been staged, the track is accessed. If the corresponding parameter 
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indicates that the requested tracic has not been staged, the requester waits while the 
requested track is staged; then the requested track is accessed. Completion of the 
error handling routine is independent of the completion of the staging of copy state 
data tracks. 

[11] The present invention further provides a data structure in a memory of a data 
storage controller, including a first field for storing a portion of a current state of an 
active copy operation, the portion being periodically destaged to a storage device. 
The data stmcture further includes a track state field having a first state indicative of 
invalid contents in the first field and a second state indicative of valid contents in the 
first field. Following commencement of an error handling operation, a background 
staging commences of the first field from the storage device to a memory device. 
When a request is received to access the first field, if the first field has been staged, 
access is allowed to the first field. If the first field has not been staged, a wait 
command is issued in response to the request to access the first field, the first field 
is staged, the wait command is revoked and the first field is accessed. 

[12] The present invention further provides a computer program product having 
computer-readable for initializing a storage controller. The computer-readable code 
includes instructions for commencing an IML, background staging copy state data 
tracks from a disk storage device to a memory device. If a request is received to 
access a track of copy state data and the track has been staged, the track is 
accessed. If the requested track has not been staged, requester waits while the 
requested track is staged; then the requested track is accessed. Preferably, 
completion of the IML is independent of the completion of the staging of copy state 
data tracks. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[13] Fig. 1 is a block diagram of a data storage system in which the present invention 
may be implemented; 

[14] Fig. 2 is a block diagram of a storage controller in which the present invention 

may be implemented; and 
[15] Figs. 3 and 4 are flow charts of one implementation of the present invention; 

4 

Docket: TUC920030181US1 
Express Mail Label: EV394802050US 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

[16] Fig. 2 is a block diagram of a primary data storage controller 200 in whicli the 
present invention may be implemented. The controller 200 is coupled through 
appropriate adapters or interfaces to one or more host devices and to one or more 
physical storage units 250, such as disk storage devices, and to a secondary 
storage controller. The controller 200 includes a memory device 210 and a 
processor 220. 

[17] The memory 210 includes an area in which metadata tracks 212 are stored. A 
first part 21 2A of each metadata track 212 is allocated to storing the current state of 
a copy services operation. A second part of each in-memory metadata track 212 is 
allocated to a track state field 21 2B. As will be described, each track state field 
21 2B contains a flag whose state (staged (valid) or unstaged (Invalid)) is indicative 
of the status of the metadata in the corresponding first part 21 2A of the track 212. 

[18] Similarly, the attached storage device 250 includes an area in which copies 252 
of the metadata tracks are stored in a non-volatile manner. 

[19] Referring also to the flow chart of Fig. 3, an implementation of the present 
invention will be described, under the control of program instmctions executed in the 
processor 220. A copy services operation (such as a Peer-to-Peer Remote Copy, a 
FlashCopy® or an Extended Remote Copy) commences (step 300) to copy customer 
data from the primary storage controller 120 to the secondary storage controller 150. 
The details of such copy operations are known in the art and covered by other IBM 
patents and will not be described herein. The current state of the copy services 
operation is stored as part of metadata 212 in the memory 210 (step 302). 
Periodically, the current state is destaged from the memory 210 to metadata tracks 
252 in the storage device 250. Such destaging may occur, for example, at regular 
time Intervals, when the copy state is updated (step 306) or at other designated 
times. Eventually, the copy services operation Is completed (step 308). 

[20] As noted above, there are circumstances, such as a power failure, software bug, 
hardware failure or other comparable significant event, which interrupt a copy 
services operation and prevent its completion. Rather than re-start the operation 
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from the beginning, furtlier delaying normal operation of the storage system, an error 
handling routine is initiated (Fig. 4, step 400). The error handling routine, also 
known as error handling behavior (EHB), may include an initial microcode load 
(IML). The track state fields 21 2B of the in-memory metadata tracks 212 are 
initialized to a first of two states (step 402) indicating that the contents of the first part 
21 2A of each track 212 (the part in which the copy state information Is stored) is 
'invalid'. Next, staging of the metadata tracks 252 from the storage device 250 to 
the memory 210 is begun (step 404). However, in contrast to conventional error 
handling routines, in the present invention the metadata tracks are staged in the 
background, without interrupting or othenwise delaying other aspects of the error 
handling routine, thus increasing the speed with which normal operations of the 
storage system may resume. When a metadata track 252 has been staged to the 
first part 21 2A of an in-memory track 212, the associated track state field 21 2B is 
changed to 'valid' (step 406) and a next track is staged. 
[21] Copy state information contained in the metadata tracks may need to be 
accessed during the error handling routine. If a request for a track is received (step 
408), the track state field 212B of the requested track is examined (step 410). If the 
field 21 2B is in the Valid' state, indicating that the metadata contents of the first part 
21 2A have been staged from the storage device 250, access to the contents is 
allowed (step 412). However, if the field 21 2B is in the 'invalid' state, indicating that 
the metadata contents of the first part 212A have not yet been staged from the 
storage device 250, access to the contents is not allowed. Instead, a 'waif 
command is issued (step 414), and the requested track is staged to the memory 210 
out of sequence (step 416). Once staged, the track state field is changed to 'valid' 
(step 418), the 'wait' command is revoked (step 420) and access is allowed (step 
412). 

[22] During the subsequent staging of the remaining metadata tracks 252 (step 422), 
any track which has previously been staged out of sequence, as indicated by the 
'valid' state of the track state field, will be skipped as re-staging is unnecessary. 

[23] The error handling routine may continue to completion (step 424) without waiting 
for the staging of metadata tracks to complete (step 422). Thus, completion of the 
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error handling routine and completion of staging the metadata tracks proceed 
independent of each other and normal customer I/O operations may commence as 
soon as the error handling routine is completed. 
[24] The described techniques may be implemented as a method, apparatus or 
computer program product using standard programming and/or engineering 
techniques to produce software, firmware, hardware, or any combination thereof. 
The computer program product (such as the operating memory 138), as used 
herein, refers to code or logic implemented in hardware logic (e.g., an integrated 
circuit chip, Programmable Gate Array (PGA), Application Specific Integrated Circuit 
(ASIC), etc.) or a computer readable medium (e.g., magnetic storage medium such 
as hard disk drives, floppy disks, tape), optical storage (e.g., CD-ROMs, optical 
disks, etc.), volatile and non-volatile memory devices (e.g., EEPROMs, ROMs, 
PROMs, RAMs, DRAMs, SRAMs, firmware, programmable logic, etc.). Code in the 
computer readable medium is accessed and executed as instructions by a 
processor. The code in which implementations are made may further be accessible 
through a transmission media or from a file server over a network. In such cases, 
the computer program product in which the code is implemented may comprise a 
transmission media such as network transmission line, wireless transmission media, 
signals propagating through space, radio waves, infrared signals, etc. Of course, 
those skilled in the art will recognize that many modifications may be made to this 
configuration without departing from the scope of the implementations and that the 
computer program product may comprise any information bearing medium known in 
the art. 

[25] The objects of the invention have been fully realized through the embodiments 
disclosed herein. Those skilled in the art will appreciate that the various aspects of 
the invention may be achieved through different embodiments without departing 
from the essential function of the invention. The particular embodiments are 
illustrative and not meant to limit the scope of the invention as set forth in the 
following claims. 
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